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•18/5/1 (Item 1 from file: 8) 

DIALOG (R) File 8 : Ei Compendex ( R) 

(c) 2004 . Elsevier. Eng.. Info. Inc . . All. rts . reserv. 

06090664 E.I. No: EIP02287014676 

Title: Building a topic -dependent maximum entropy model for very large 
corpora 

Author: Wu, Jun; Khudanpur^ Sanjeev 

Corporate Source: Ctr. for Language and Speech Proc. The Johns Hopkins 
University, Baltimore, MD 21218, United States 

Conference Title: 2002 IEEE International Conference on Acustics, Speech, 
and Signal Processing ^ 

Conference Location: Orlando, FL, United States Conference Date: 
20020513-20020517 

Sponsor: IEEE 

E.I. Conference No.: 59254 

Source: ICASSP, IEEE International Conference on Acoustics, Speech and 
Signal Processing - Proceedings v 1 2002. p I/777-I/780 (IEEE cat n 
02ch37334) 

Publication Year: 2002 

CODEN: IPRODJ ISSN: 0736-7791 

Language: English 

Document Type: CA; .(Conference. Articleji Treatment: T; (Theoretical.); X; 
(Experimental) 

Journal Announcement: 0207W3 

Abstract: Maximum entropy (ME) techniques have been successfully used to 
combine different sources of linguistically meaningful constraints in 
language models. However, most of the current ME models can only be used 
for small corpora, since the computational load in training ME models for 
large corpora is unbearable. This problem is especially severe when 
non-local dependencies are considered. In this paper, we show how to train 
and use topic-dependent ME Kjp^e^ls efficiently for a very large corpus. 
Broadcast News (BN) . The training time is greatly reduced by hierarchical 
training and divide-and-conquer approaches. The computation in using the 
model is also simplified by pre-normalizing the denominators of the ME 
model. We report new speech recognition results showing improvement with 
the topic model relative to the standard N-gram model for the Broadcast 
News task. 7 Ref s . 

Descriptors: *Linguistics ; Speech recognition; Mathematical models; 
Calculations; Probability; Iterative methods; Algorithms 

Identifiers: Maximum entropy; Large corpora; Broadcast news; Hierarchical 
training; Unigram frequency ; Generalized iterative scaling; Variant 
improved -iterative scaling - , 

Classification Codes: 

903.2 (Information Dissemination); 751.5 (Speech); 921.6 (Numerical 
Methods); 922.1 (Probability Theory) 

903 (Information Science); 751 (Acoustics, Noise & Sound); 921 
(Applied Mathematics); 922 (Statistical Methods) 

90 (ENGINEERING, GENERAL); 75 (SOUND & ACOUSTICAL TECHNOLOGY); 92 
(ENGINEERING MATHEMATICS) 

>.^' 

18/5/2 (Item 2 from file: 8) 

DIALOG (R) File 8 : Ei Compendex (R) 

(c) 2004 Elsevier Eng. Info. Inc. All rts. reserv. 

06013265 E.I. No: EIP02096873773 
Title: Latent semantic kernels 

Author: Cristianini, Nello; Shawe-Taylor, John; Lodhi, Huma 

Corporate Source: Department of Computer Science Royal Holloway 
University of London, Egham, Surrey TW20 OEX, United Kingdom 

Source: Journal of Intelligent Information Systems v 18 n 2-3 March/May 
2002. p. 127-152. . . . . , 

Publication Year: 2002 

CODEN: JIISEH ISSN: 0925-9902 

Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical); X; 



(Experimental) 

Journal Announcement: 0203W1^^^ , . 

Abstract: Kernel methods like' support vector machines have successfully 
been used for text categorization. A standard choice of kernel function 
has been the inner product between the vector-space representation of 
two documents, in analogy with classical information retrieval (IR) 
approaches. Latent semantic indexing (LSD has been successfully used for IR 
purposes as a technique for capturing semantic relations between terms 
and inserting them into the similarity measure between two documents. One 
of its main drawbacks, in IR, Is its computational cost. In this paper we 
describe how the LSI approa^V 'can be implemented in a kernel-defined 
feature space. We provide -experimental results demonstrating that the 
approach can significantly improve performance, and that it does not 
impair it. 27 Refs. 

Descriptors: ^Semantics; Text processing; Information retrieval; Indexing 
(of information) ; Vectors 

Identifiers: Latent semantic indexing (LSI); Support vector machines 

Classification Codes: 

903.2 (Information Dissemination); 723.5 (Coitputer Applications) ; 903.1 
(Information Sources & Analysis); 921.1 (Algebra) 

903 (Information Science); 723 (Computer Software, Data Handling & 
Applications); 921 (Applied Mathematics) ... 

90 (ENGINEERING, GENERAL); 72 (COMPUTERS '& DATA PROCESSING) ; 92 
(ENGINEERING MATHEMATICS) 



18/5/3 (Item 3 from file: 8) 

DIALOG (R) File 8 : Ei Compendex(R) 

(c) 2004 Elsevier Eng. Info. Inc. All rts . reserv. 

04020633 E.I. No: EI P94 12^1^.74 92 

Title: Probabilistic algorithm for segmenting non-kanji Japanese strings 

Author: Teller, Virginia; Batchelder, Eleanor Olds 

Corporate Source: City Univ of New York, New York, NY, USA 

Conference Title: Proceedings of the 12th National Conference on 

Artificial Intelligence. Part 1 (of 2) 

Conference Location: Seattle, WA, USA Conference Date: 

19940731-19940804 

Sponsor: American Association for Artificial Intelligence 
E.I. Conference No.: 21453 

Source: Proceedings of the National Conference on Artificial Intelligence 
y 1 1994, AAAI,, Menlo, Park,. CA^^^USA. p 742-747 
Publication Year: 1994* 
CODEN: PNAIEE 
Language: English 

Document Type: CA; (Conference Article) Treatment: A; (Applications); T 

; (Theoretical) 

Journal Announcement: 9502W3 

Abstract: We present an algorithm for segmenting unrestricted Japanese 
text that is able to detect up to 98% of the words in a corpus. The 
segmentation technique, whi^Vi'^s simple and extremely fast, does not depend 
on a lexicon or any formal -notion of what a word is in Japanese, and the 
training procedure does not require annotated text of any kind. Relying 
almost exclusively on character type information and a table of hiragana 
bigram frequencies, the algorithm makes a decision as to whether to create 
word boundaries or not. This method divides strings of Japanese characters 
into units that are computationally tractable and that can be justified on 
lexical and syntactic grounds as well. (Author abstract) 17 Refs. 

Descriptors: ^Natural language processing systems; Algorithms; 
Probability; Knowledge based systems; Character recognition; Statistics; 
Learning systems 

Identifiers: Probabilistic algorithms; Non Kan ji Japanese strings; 
Segmentation; Lexicon; Hiragana bigram frequency ; Tagging 
Classification Codes: 
723.4.1 (Expert Systems) 

723.2 (Data Processing); 921.6 (Numerical Methods) ; 922.1 (Probability 
Theory); 723.4 (Artificial Intelligence); 922.2 (Mathematical Statistics) 



• 723 (Computer Software); 921 (Applied Mathematics) ; 922 (Statistical 
Methods) ' ...... w. 

72 (COMPUTERS & DATA PROCESSING); 92 (ENGINEERING MATHEMATICS) 



18/5/5 (Item 2 from file: 35) 

DIALOG(R) File 35 : Dissertation Abs Online 

(c) 2004 ProQuest Inf o&Learning. All rts . reserv. 

01892669 ORDER NO: AADAA-130^4/649 

A framework for description 7*" sharing and retrieval of semantic visual 

information 

Author: Park, Youngchoon 
Degree: Ph.D. 
Year: 2002 

Corporate Source/Institution: Arizona State University (0010) 

Adviser: Forouzan Goishani 

Source: VOLUME 63/05-B OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 2461. 186 PAGES 
Descriptors: COMPUTER SCIENCE 
Descriptor Codes: 0984 

ISBN: " ■ 0'-493-69339-4 ... 

Multimedia data, in particular images and videos, need to be stored, 
classified and indexed based on the syntactic and semantic content. 
However, techniques for content-based image or video retrieval are not 
mature enough to detect and classify semantic objects completely. 

For the purpose of semantic object-based visual information 
management; a novel visual content description unit, called a visual term; 
a new content description schi^me of image objects with visual terms; and a 
novel contextual image matqHlng method are proposed and retrieval 
effectiveness of the proposed matching technique is demonstrated. 

A visual term, a symbolic representation of an atomic component of 
visual content (e.g., texture feature, shape feature and color property), 
is obtained from a set of training samples through various steps, including 
geometric primitive detection, color component extraction, texture property 
acquisition and hierarchical feature vector grouping. A collection of 
visual terms defines a quality dimension of an application and serves as a 
set of standardized vocabularies. 

The visual term frequency (i.e., the visual term histogram) of an 
image is a natural way to represent and compare two images . In 
addition, a visual term ^ccurrferfce ' vector *' (i : e . , a binary vector'ih which 
each component indicates the occurrence of certain visual terms in an 
image) can be efficiently used the contextual retrieval of images. The 
proposed image representation allows seamless integration of heterogeneous 
features without requiring complex normalizations and weighting mechanisms 
and improves the performance of semantic or visual content classification. 

A robust semantic object learning from the proposed content 
representation was developed for creating semantic object templates. A 
semantic model was obtained f^om quality samples and represented as a 
probability density functiqiS^ For the purpose of accurate estimation of 
model parameters from the samples, a similarity-based visual term frequency 

estimation and weighting scheme has been used. The results of mining 
characterized the contextual relations of visual terms and generalized 
visual appearances of a semantic model. Contextual information that was 
extracted from the correlation analysis of visual term co-occurrences was 
added into the original visual data for accurate model classification. 
Automated image classification naturally permits mapping a low-level 
feature to a high-level semantic concept managed in an ontology and 
provides ways to navigate the database content based on semantic 
similarities and relations . 

Experiments In this 'study' Kave 'showri "moire than 11% improvement on 
categorization of 1100 images into 11 semantic categories, and 
approximately 7% improvement on context-independent (i.e., visual 
appearance-based image matching) image retrievals compared to other image 
classification and retrieval approaches. 



18/5/6 (Item 3 from file: 35) 

DIALOG (R) File 35 : Dissertation Abs Online 

(c) 2004 ProQuest Inf o&Learning. All rts. reserv, 

01611916 ORDER NO: NOT AVAILABLE FROM UNIVERSITY MICROFILMS INT'L. 

A SEMANTIC TRANSFORMATION MODEL OF AUTOMATIC INDEXING (INDEXING, AMBIGUITY) 

Author: ZHENG, FEN 
Degree: PH.D. 
Year: 1997 

Corporate Sourc'e/Institutionr"* UNIVERSiTY OF NEW SOUTH WALES (AUSTRALIA) 

(0423) 

Source: VOLUME 58/10-A OF DISSERTATION ABSTRACTS INTERNATIONAL. 
PAGE 3766. 

Descriptors: LIBRARY SCIENCE ; INFORMATION SCIENCE ; ARTIFICIAL 

INTELLIGENCE 
Descriptor Codes: 0399; 0723; 0800 

SETRAN, a SEmantic TRANs^to'rmation model of automatic indexing, is 
designed to theoretically analyse the problems of ambiguity in subject 
representation on three layers: semantic interpretation, semantic 
presentation and pattern matching. Four major factors: semantic value, 
semantic relationship , syntactic relationship and semantic field, 
greatly affect automatic indexing by changing the different levels of text 
isolation and by posing a variety of ambiguity in subject representation. 

In semantic interpretation, it is found that the syntactic 
representation of subjects can be achieved by obtaining conceptual 
categories from title parsing. The similarity of syntax between titles and 
subjects depends on the precision of terms and the coverage of facets in 
titles. The recognition of phrase structures is the key to title parsing. A 
parser using a syntactic analysis with semantic analysis can he more 
effective and less ambiguous. Terms, links and relations are fundamental to 
word significance. 

In semantic representation f two issues are differentiated for 
retrieval purpose: (i) subject representation by text-based and by 
knowledge-based; and (ii) semantic relationships as linguistic 
relations and as factual relations. Approaches are suggested to control the 
problems of ambiguity in various relations such as synonymy, taxonomy, 
hierarchy, association, collo^a^tion, antonymy and homography. The 
incorporation of traditional thesauri as part of a knowledge base to 
support automatic subject representation is recommended. 

Pattern matching discusses the matching between the natural language 
of a text and its systematic language in the deep structure of concepts. 
The smallest semantic units affect the capability of index terms by 
determining the possibility of combination between terms. Such combination 
may change the pattern values which determine pattern matching modes. The 
deep structure-based online version of pre-defined indexing languages can 
greatly improve the expression of systematic languages in automatic 
indexing. Subject representation models confine the relational symmetry in 
pattern matching. Subject indexing and subject classification are referred 
to in different subject representation models. 



18/5/7 (Item 4 from file: 35) 

DIALOG (R) File 35 : Dissertation Abs Online 

(c) 2004 ProQuest Inf o&Learning. All rts. reserv. 

01572565 ORDER NO: AAD97-28810 

EXTERNALIZING A WORK DOMAIN S^IttJCTURE ON A HYPERTEXT INTERFACE USING AN 
ABSTRACTION HIERARCHY: SUPPORTING COMPLEX SEARCH TASKS AND PROBLEM- SOLVING 
ACTIVITIES 

Author: XU, WEI 
Degree: PH.D. 
Year: 1996 

Corporate Source/Institution: MIAMI UNIVERSITY (0126) 
Directors: MARVIN J. DAINOFF; LEONARD S. MARK 

Source: VOLUME 58/03-B OF DISSERTATION ABSTRACTS INTERNATIONAL. 



PAGE 1581. 122 PAGES 
Descriptors: PSYCHOLOGY, INDUSTRIAL ; ENGINEERING, INDUSTRIAL ; COMPUTER 
SCIENCE 

Descriptor Codes: 0624; 0546; 0984 

When searching in a complex hypertext application, users may 
experience navigation disorientation and cognitive overload. This study 
examined the relative effectiveness of a semantic representation for a 
hypertext interface and the reSrationship" between semantic 
representations and complexity of search task. The goal is to design a more 
effective hypertext interface for complex documents in support of users* 
complex and problem-solving tasks. 

The proposed interface was developed based on Rasmussen's abstraction 
hierarchy. A hypertext-based application with a complex document, called 
HyperErgo, was created as a research vehicle. The domain knowledge was 
ergonomic designs for computer workstation. Two interfaces were implemented 
based on two semantic repres^en tat ions : a classification hierarchy (CH) 
and the abstraction hierarchy^ (AH) . 

A 2 $\times$ 3 factorial design (two interfaces x three search tasks) 
was adopted for an experiment. The search tasks included a simple search 
task, a complex search task, and a problem-solving task. Twenty four 
college student were recruited. Four measures were taken: search time, 
time spent per node, deviation ratios, and subjective ratings. 

The results supported research hypotheses. When performing the complex 
or problem-solving task, subjects using AH spent less time and experienced 
less navigation disorientation than those subjects using CH. As the task 
complexity increased, the advantage of AH over CH increased as measured 
by search time, time spent per node, and deviation ratios. No difference 
was found between- the two i-nterf-aces for -the • simple task. Subjects using AH 
reported experiencing less navigation disorientation than those subjects 
using CH, 

This study suggests: (1) For hypertext applications with complex 
documents, AH would be a more effective semantic representation of an 
interface to support complex and problem-solving tasks as compared to CH; 
(2) There is a relationship between semantic representations and 
complexity of search task, as supported by the interaction between the type 
of interface and the type of search task; (3) When choosing a semantic 
representation of a hyperte^^interf ace for a complex document, designers 
should consider users' search tasks and work domains as well as externalize 
domain constraints and functional relations of work domains on the 
interface to support users' complex and problem-solving tasks. 



18/5/8 (Item 5 from file: 35) 

DIALOG (R) File 35 : Dissertation Abs Online 

(c) 2004 ProQuest Inf o&Learning. All rts. reserv. 

01443906 ORDER NO: AADAA-IC436643 

STATISTICAL LANGUAGE PROCEMINS^ -BASED ON ' SELF-ORGANISING WORD 
CLASSIFICATION 

Author: MCMAHON, JOHN GEORGE GAVIN 
Degree: PH.D. 
Year: 1994 

Corporate Source/ Institution: QUEEN'S UNIVERSITY OF BELFAST (NORTHERN 
IRELAND) (0725) 

Source: VOLUME 56/04-C OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 1082. 208 PAGERS 
Descriptors: COMPUTER SCIflftcE 
Descriptor Codes: 0984 

An automatic word classification system has been designed which 
processes word unigram and bigram frequency statistics extracted from a 
corpus of natural language utterances. The system implements a type of 
simulated annealing which employs an average class mutual information 
metric. Resulting classifications are hierarchical, allowing variable class 
granularity. Words are represented as structural tags: unique n-bit numbers 
the most significant bit-patterns of which incorporate class information. 



Therefore, access to a structural tag iiranediately provides access to all 
classification levels for the corresponding word. The classification system 
has successfully revealed some of the structure of two natural languages, 
from the phonemic to the semantic level. The system has been favourably 
compared, directly and indirectly, with other word classification systems. 
Class based interpolated language models have been constructed to exploit 
the extra information supplied by structural tag classifications. 
Experiments measuring test §e^ -perplexity have shown that the new models 
significantly improve performance. 



18/5/10 (Item 7 from file: 35) 

DIALOG (R) File 35 : Dissertation Abs Online 

(c) 2004 ProQuest Inf o&Learning. All rts. reserv, 

1044580 ORDER NO: AADD — 84522 

COLLATIVE SEMANTICS: A SEMANTICS FOR NATURAL LANGUAGE PROCESSING 

Author: FASS, DAN 

Degree: PH.D. ^ 

Year: 1988 

Corporate Source/Institution: UNIVERSITY OF ESSEX (UNITED KINGDOM) (0873 

) 

Source: VOLUME 49/12-B OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 5398. 400 PAGES 
Descriptors: COMPUTER SCIENCE; LANGUAGE, LINGUISTICS 
Descriptor Codes: 0984; 0290 

Available from UMI in ^^o^ciation with The British Library. Requires 
signed TDF. 

A semantics for natural language processing called Collative Semantics 
(CS) is proposed. The main semantic phenomena it addresses are lexical 
ambiguity and semantic relations . Seven kinds of semantic relation 
are distinguished: literal, metonymic, metaphorical, anomalous, redundant, 
inconsistent, and novel relations. 

CS contains four components: two representations / sense- frames and 
semantic vectors and two processes, collation and screening. Sense-frames 
are a knowledge representation scheme which represents individual word 
senses. In sense-frames, word senses also perform the functions of semantic 
primitives. Collation matches the sense-frames of two word senses and 
discriminates the semantic relations between them as a complex system 
of mappings between their sense-frames. Semantic vectors represent the 
systems of mappings produced by collation and hence the semantic 
relations encoded in those mappings (except for metonymic relations). 
Screening chooses between two semantic vectors by applying rank orderings 
among semantic relations and a measure of conceptual similarity, 
thereby resolving lexical ambiguity. 

The foundations of CS rest on three main theoretical ideas. The first 
is a linguistic view of kno^^dge representation which, by imposing 
restrictions on natural language and relaxing controls on knowledge 
representation, reaches a point where the semantic primitives of a 
knowledge representation function like senses of words from natural 
language, as in sense-frames. 

The second idea is a distinction made between knowledge and coherence; 
the third is a large-scale framework consisting of four constructs, two 
representations and two processes, in which coherence plays a major 
organising role. The four components of CS are instances of these four 
constructs. The framework distinguishes between the representation of 
knowledge and coherence. Semantic vectors are an example of the latter. 

CS.has been implemented in^^a computer program called metaS which 
analyses sentences, discriminates the seven kinds of semantic relation 
between pairs of word senses in those sentences, and resolves any lexical 
ambiguity. 
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DIALOG (R) File 202: Info. Sci. & Tech. Abs. 
(c) 2004 EBSCO Publishing. All* rts. reserv. 



3101754 

Effect of animation in enhancing descriptive and procedural texts in a 
multimedia learning environment. 

Author(s): Large, A; Breuleux, A; Renaud, A 
Corporate Source: McGill Univ., Montreal, Quebec 

Journal of the American Socie^^^ for Information Science vol. 47, no. 6 

, pages 437-448 

Publication Date: Jun 1996' 

ISSN: 0002-8231 

Language: English 

Document Type: Journal Article 

Record Type: Abstract 

Journal Announcement: 3100 

This article reports the third and final phase of a research project to 
investigate the role of animation in enhancing recall and comprehension of 
text by grade 6 primary school students. This phase had three objectives: 
To determine whether a • complex descriptive • text is enhanced by* animation 
so long as the animation exhibits close semantic links an animation 
with a text so as to increase comprehension of that text; and to 
investigate the relationship between students* spatial skills and their 
ability to recall and comprehend a text enhanced with still images and 
animation. A descriptive text on the structure and functions of the heart 
from Compton*s Multimedia Encyclopedia was linked to a still image and 
two animation sequences developed by the research team and which were both 
more extensive and more complet^ely integrated semantically with the text 
than in the original Compton*ls version. Four presentation conditions were 
produced: Text; text and still image; text, still image, and animations, 
and captions. Students were tested for spatial ability and divided into two 
groups: Low and high spatial ability. Their comprehension was tested using 
three tasks: Written recall, multiple choice questions, and 
problem-solving. Animation improved significantly only the problem-solving 
task, but this was the measure which involved the highest level of 
cognitive effort. Students with high spatial ability in general performed 
better than students with low spatial ability regardless of presentation 
condition, and in the case of propositional and thematic recall, this was 
significant . 

Descriptors: Animation; Educational technology; Elementary schools; 
Learning 

Classification Codes and Description: 3.05 (Instruction); 2.06 
(Psychological Aspects, Cognition); 5.08 (Graphics and Displays) 

Main Heading: Information Generation and Promulgation; Research Methods; 
Information Processing and Control 

18/5/14 (Item 2 from f^h: 202) 

DIALOG (R) File 202: Info. Sci. & Tech. Abs . 
(c) 2004 EBSCO Publishing. All rts . reserv. 

0800564 

Handling of varied data bases in an information center environment. 

Book Title: 1971 July 23. Research Institute, Illinois Institute Of 
Technology, Chicago. 24 P. 0 Ref. Edrs: Ed-067 143; He $3.29, Mf $0.65. 
Presented At The Conference On Computers In Chemical Education And 
Research, Northern Illinois University, Dekalb, Illinois. 
Author(s): Williams, Martha E 

■Publication Date:- 1971 ... 

Language: English 
Document Type: Book Chapter 
Record Type: Abstract 
Journal Announcement: 0800 

Information centers exists to provide information from machine- readable 
data base to users in industry, universities, and other organizations. The 
computer search center of the. i^t research institute was designed with a 



flumber of variables and uncertainties before it. In this paper, the author 
discusses how the center was designed to enable it to accoiranodate the many 
variables it would face in providing different services to diverse users. 
The system design is discussed in terms of the unpredictable future and in 
terms of the user to be served. User aids that were developed (search 
manual, truncation guide, freauency lists, key letter in context (klic) 
index and bigram frequen«y^ 'list) , and communication with the user are 
discussed. The reasons for 'using a selective dissemination of information 
service (sdi) are presented. 

Classification Codes and Description: 7.01 (Planning, Administration) 
Main Heading: Libraries and Information Services 
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DIALOG (R) File 2:INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

7855519 INSPEC Abstract Number: C2004-03-7330-119 

Title: Automated enhancement of description logic-defined terminologies to 
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Abstract: Compositional (post-coordinated) terminologies are one 
potential solution to the problem of content completeness. However, they 
have the potential to render data incomparable. For computers to determine 
that compositional expressions are comparable, the relations between the 
•composed- compon^n^ts that .ar-e understood implicitly by human readers must be 
represented explicitly for computer manipulation. We discuss a technique 
for discovering and formalizing the implicit semantic relationships in 
two vocabularies: the International Classification of Disease Version 9 
Clinical Modification (ICD9-CM), and SNOMED-Ref erence Terminology 
(SNOMED-RT) . The results of this technique are used to augment the existing 
SNOMED-RT relation ontology (K.A. Spackman et al., 1998), which is a 
necessary step in automated concept mapping between systems. The reference 
terminology must contain all the semantics implicit in the classification 
in order to map concepts^ ^between the two representations . We also 
provide an explicit representation of the implied semantics of ICD9-CM. 
This tabulation will be useful for other knowledge engineering efforts 
involving ICD9-CM. (26 Refs) 
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Treatment: Theoretical (T) ; Experimental (X) 

Abstract: Experiments were conducted to examine an approach from language 
modeling side to improving noisy speech recognition performance. By 
adopting appropriate word strings as new units of processing, speech 
recognition performance was • improved by acoustic effects as well as by 
test-set perplexity reduc^^. Three kinds of word string language models 
were evaluated, whose addxtional lexical entries were selected based on 
combinations of part of speech information, word length, occurrence 
frequency, and log likelihood ratio of the hypotheses about the bigram 

frequency . All of the three word string models reduced errors in 
broadcast news speech recognition, and also the lowered test-set 
perplexity. The word string model based on log likelihood ratio exhibited 
the best improvement for noisy speech recognition, by which deletion errors 
were reduced by 26%, substitution errors by 9.3%, and insertion errors by 
13%, in the experiments using the speaker-dependent, noise-adapted 
triphone. Effectiveness of word string models on error reduction was more 
prominent for noisy speech than^^f or studio-clean speech. (23 Refs) 
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Conference Title: Proceedings of Second International ICSC Symposium on 
Fuzzy Logic and Application^^' 

Conference Date: 12-14 Feb. 1997 Conference Location: Zurich, 
Switzerland 

Language: English Document Type: Conference Paper (PA) 
Treatment: Theoretical (T) 

Abstract: This paper describes the application of fuzzy logic concepts to 
model semantic variables in the knowledge engineering phase, during a 
hybrid expert system application. The fuzzy logic is a suitable approach to 
handle imprecise concepts, such as predicates, quantifiers and predicate 



modifiers. The system developed named SECOX-HI, is a hybrid expert system 
model concerning the applications in classification problems. Hybrid 
architectures for expert. ^ systems (ES) employ two knowledge 
representation mechanism>*%che symbolic and connectionist approaches. 
The hybrid expert system was applied in a decision task, the classification 
of operation states of the floodgates movements from a reservoir, in a 
hydroelectric power station. In the role of to determine the occurrence 
of the operation states, several information must be considered as well the 
semantic predicates associated to them. These predicates are quantifiers 
such as high, medium and low. The fuzzy logic concepts was used to model 
the fuzzy variables of the application domain and to establish a confidence 
degree of each variable. This confidence degree perform the input layer of 
the neural network model. The paper presents the description of the problem 
domain, the hybrid expert system model and the detezxaination of the fuzzy 
•sets and- their membership f-uncti-ons * (IT -Refs) • • 
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Abstract: An automatic word-classification system has been designed that 
uses word unigram and bigram frequency statistics to implement a binary 

top-down form of word clustering and employs an average class mutual 
information metric. Words are represented as structural tags-n-bit numbers 
the most significant bit-patterns of which incorporate class information. 
The classification system has revealed some of the lexical structure of 
English, as well as some phonemic and semantic structure. The system has 
been compared-directly and indirectly-with other recent word-classification 
systems;* We see- our ela-ssif i-cation as a means towards the end of 
constructing multilevel class-based interpolated language models. We have 
built some of these models and carried out experiments that show a 7% drop 
in test set perplexity compared to a standard interpolated trigram language 
model. (62 Refs) 
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Abstract: The article reports on a research project to investigate the 
role of animation in enhancing^ recall and comprehension of text by grade 6 
primary school students . ^"^ne project had three objectives: to determine 
whether a complex descriptive text is enhanced by animation so long as the 
animation exhibits close semantic links with the text; to explore the 
importance of captions in linking an animation with a text so as to 
increase comprehension of that text; and to investigate the relationship 
between students' spatial skills and their ability to recall and comprehend 
a text enhanced with still images and animation. A descriptive text on the 
structure and functions of the heart from Compton's Multimedia Encyclopedia 
was linked to a still image and two animation sequences developed by 
the research team. Four presentation conditions were produced: text; text 
and still image; text, still image, and animations; and text, still image, 
•animations, and - captions.- Students were 'tested for spatial ability and 
divided into two groups: low and high spatial ability. Comprehension was 
tested using three tasks: written recall, multiple choice questions, and 
problem solving. Animation improved significantly only the problem solving 
task, but this was the measure which involved the highest level of 
cognitive effort. Students with high spatial ability in general performed 
better than students with low spatial ability regardless of presentation 
condition, and in the case of prepositional and thematic recall, this was 
significant. The addition of, captions to the animation sequences had no 
significant effect but this may be because the sequences- also included 
labels which could have obviated the need for captions. (12 Refs) 
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Abstract: Similarity-based retrieval of images is an important task in 
many image database applications. A major class of users' requests requires 
retrieving those images in the database that are spatially similar to the 
query image. We propose an algorithm for computing the spatial similarity 
between two symbolic images • A symbolic image is a logical 
representation of the original image where the image objects are uniquely 
labeled with symbolic names. Spatial relationships in a syxnbolic 
image are represented as edges in a weighted graph referred to as 
spatialyorientation graph. Spatial similarity is then quantified in terms 
of the number of as well * a s the extent to which, the edges of the 
spatial-orientation graph of the database image conform to the 
corresponding edges of the spatial-orientation graph of the query image. 
The proposed algorithm is robust in the sense that it can deal with 
translation, scale, and rotational variances in images. The algorithm has 
quadratic time complexity in terms of the total number of objects in both 
the database and query images. We also introduce the idea of quantifying 
a system's retrieval quality by having an expert specify the expected rank 
ordering with respect to each^^c^uery for a set of test queries. This enables 
us to assess the quality of algorithms comprehensively for retrieval in 
image databases. The characteristics of the proposed algorithm are compared 
with those of the previously available algorithms using a testbed of 
images. The comparison demonstrated that our algorithm is not only more 
efficient but also provides a rank ordering of images that consistently 
matches with the expert's expected rank ordering. (10 Refs) 
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Abstract: If pictures or sequences of pictures are to be interpreted^ 
some means of obtaining a symbolic description of the tone or colour of 
pictures must be the basis of such a method of interpretation. Such 
symbolic descriptions can be formalised to indicate the properties of the 
picture symbols or to indicate relationships between picture symbols. The 
interpretation of picture sequences involves two essential processes: 
the grouping of picture symbols to describe objects and the production of 
correspondence relationships ^ between the symbolic descriptions of a 
number of pictures in a sequence. The amount of data required to evaluate 
picture sequence is, however, so large that it can only be systematically 
managed with some type of 'background' storage. The author, therefore, 
discusses the problems involved in the first stage in achieving; a data 
base system for pictures, the special purpose of which will be to make 
available, to the user, picture descriptive relationships which will enable 
him to have access to any picture he requires and, also, to help him in the 
analysis of pictures by means of relationships stored in the data base. ( 
10 Refs) 
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ABSTRACT: The third and final phase of a research project to investigate 
the role of animation in enhancing recall and comprehension of text by 
grade 6 primary school students is reported. This phase had three 
objectives: 1. to detenpWe whether a complex description text is 
enhanced by animation sc5 long as the animation exhibits close semantic 

links with the text, 2. to explore the importance of captions in linking 
an animation with a text so as to increase comprehension of that text, and 
3. to investigate the relationship between students' spatial skills and 
their ability to recall and comprehend a text enhanced with still images 
and animation. A descriptive text on the structure and functions of the 
heart from Compton * s Multimedia Encyclopedia was linked to a still ima^'e 

of and two animation sequences developed by the research team and which 
were both more extensive and more completely integrated semantically with 
the text than in the original Compton *s version. 
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. . .ABSTRACT: of text by grade 6 primary school students is reported. This 
phase had three objectiyW: 1. to determine whether a complex 
description text is enhanced by animation so long as the animation exhibits 
close semantic links with the text, 2. to explore the importance of 

captions in linking an animation with a text... 

... the structure and functions of the heart from Compton ' s Multimedia 
Encyclopedia was linked to a still image of and two animation sequences 
developed by the research team and which were both more extensive and more 
completely integrated. . . 
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character string to one tjjp^ -'of symbolic character determined in 
correspondence to one general character adjacent to the special 
character. Also, occurrence frequencies of the general or symbolic 
characters of each chain are counted and stored in a recording medium 
with the registration character chains. When a retrieval character chain 
is input, occurrence frequencies of particular character chains 
corresponding to all retrieval character chains detected from the 
retrieval character string in the same manner are read out from the 
recording medium and are collated with each other, and a particular 
character string agreeing with the retrieval character chain is retrieved 
from the registration character string. Because an occurrence frequency 
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.SPECIFICATION pair of occurrence frequencies of the fore general 
character and the rear general character of each general two -character 
chain recorded in the character chain recording region as an occurrence 
frequency set, recording an occurrence ... in the retrieval character 
string is converted into a type of symbolic character according to the 
general- symbolic character type relationship in the same manner as 
the conversion of the special characters arranged in the registration 
character string. . ' ... 

.chains of the series of particular two-character chain types arranged in 
the particular chain order is determined by collating a plurality of 
particular position numbers of the particular registration two-character 
chains with each ... character arranged in the retrieval character string 
is converted into two symbolic characters according to the general- 
symbolic character type relationship in the same manner as the 
conversion of each special character arranged in the registration 
character string... 

.chains of the series of particular two-character chain types arranged in 
the particular chain order is determined by collating a plurality of 
particular position numbers of the particular registration two-character 
chains with each. . .type of the general character "E" placed just after 
the special character. Fig. 7B shows a general- symbolic character type 
relationship . For example, the special character is replaced with a 
symbolic character "F (circumflex) " in cases where a... string, converting 
each detected special character into a third symbolic character 
determined according to the same general- symbolic character type 
relationship to produce a converted retrieval character string; 

a retrieval two-character* chain detecting unit 64 for detecting. a 
converted registration character string 

"AB — CDD (circumflex) E (circumflex) EF — GH". Fig. 9B shows a general- 
symbolic character type relationship . For example, in cases where one 
special character "ae" is placed between a general character "A" and. . . 

.AA(circumflex) Cg (circumflex) Cg" or "CgCg (circumflex) A( circumflex) A" . The 
symbolic character "A ( circumflex) " is determined according to the 
character "A", and the symbol^ "Cg ( circumflex) " indicates a symbolic 
character determined ac^rding to the arbitrary general character "Cg" 
such as "A", "B", — or "Z". Also, in cases where... 

. circumflex) Cg (circumflex) Cg" or "CgCg (circumflex) B (circumflex) B" by 
using the symbolic character "B (circumflex) " determined according to 
the character "B" . 

Thereafter, in the detecting unit 62, a plurality of registration 
two-character. . . 
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ABSTRACT EP 327266 A2 

There are disclosed methods and applications for determination of parts 
of speech (part-of-speech tagging) and noun phrase parsing for text or 
other non-verbal record of a communication. The part-of-speech tagging 
method optimizes the product of individual word lexical probabilities and 
normalized three-word contextual probabilities. Normalization involves 
dividing by contained two-word contextual probabilities. The method for 
noun phrase parsing involves optimizing the choices of, typically 
non-recursive, noun phrases by considering all possible beginnings and 

• endings thereof., preferably based. on. the output of the part-of-speech 
tagging method. The disclosed applications include text-to-speech 
synthesis and text searching and editing, among a broader range of 
possible applications. 
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..SPECIFICATION of speech X given the succeeding parts of speech Y and Z, 
already determined, divided by the " bigrazn " frequency of observing 
part of speech Y given part of speech Z. The latter two data can be... 



...SPECIFICATION of speech X^Viven the succeeding parts of speech Y and Z, 
already determined, divided by the bigram " frequency of observing 
part of speech Y given part of speech Z. The latter two data can be... 

...CLAIMS all parts of speech, and 

the contextual probability is estimated by dividing the trigram 
frequency by the blgrazn frequency / where 

the trigram frequency is the frequency of occurrence of the 
particular part of speech in sequence with the two following parts of 
speech, as already determined for the two following words, and 

the bigram frequency is the frequency of occurrence of the 
particular part, of .speechwOf . the following word in sequence... . 

...CLAIMS all parts of speech, and 

the contextual probability is estimated by dividing the trigram 
frequency by the bigram frequency / where the trigram frequency is 
the frequency of occurrence of the particular part of speech in 
sequence with the two following parts of speech, as already 
determined for the two following words, and the bigram frequency 
is the frequency of occurrence of the particular part of speech of 
the following word in sequence. . . 
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English Abstract 

A system, including software components, that efficiently and dynamically 
analyzes changes to data sources, including application programs, whithin 
an integration environment and simulatneously re-codes dynamic adapters 
between the data sources is disclosed. The system also monitors at least 
two of said data sources to detect similarities (3) within the data 
structures of said data s.ourp^s and .generates new dynamic adapters tp 
integrate said at least two of said data sources. The system also 
provides real time error validation of dynamic adapters as well as 
performance optimization of newly created dynamic adapters that have been 
generated (5) under changing environmental conditions. 

French Abstract 

L' invention porte sur un systeme comprenant des composants logiciels et 
qui analyse de maniere puiss'ante et dynamique des changements affectant 
des sources de donnees, y^Wpris des programmes d' application, dans un 
environnement d * integration et qui recede simultanement des adaptateurs 
dynamiques entre les sources de donnees, Le systeme surveille egalement 
au moins deux des sources de donnees considerees pour detecter des 
similitudes (3) dans les structures de donnees de ces sources de donnees 
et genere de nouveaux adaptateurs dynamiques pour integrer au moins deux 
des sources de donnees considerees. Le systeme assure egalement un 
controle d'erreur en temps reel des adaptateurs dynamiques, ainsi que 
1 * optimisation de puissance des adaptateurs dynamiques recemment crees 
qui ont ete generes (5) dans des conditions d* environnement evolutif. 

.Legal Status (Type, Date,. T.ext)_. . ... . ... 

Publication 20030724 Al With international search report. 

Examination 20031030 Request for preliminary examination prior to end of 

19th month from priority date 

Fulltext Availability: 

Claims 

Claim 

between two data sourc^V 'a 
process for developing dynamic adapters including the steps of: 
before an integration plan between said two data sources has been 
generated, an App2App Similarity Mapper determining the similarities 
between said two data sources and informing a Plannersof tware component 
to generate a new plan. .. Common Ontology library, the process comprising 
the steps 
of: 

monitoring each of said data sources by an Assessment Micro Agent 

including a Schema Manager; 

said Assessment Micro Agent creating an inventory of the data 
structures and func.tiorjaliti^a. of .said .dat^ sources and making, said , 
inventory available to predetermined 

1 ones of said other components of said system; 

said Assessment Micro Agent detecting a change in either of said data 
sources and notifying at least some of... devices of claim 123 where said 
process includes the further step of the Schema Manager of said 
Assessment Micro Agent reading the data structure stored in a data 
source to produce a schema that is... more processor readable storage 
devices of claim 123 where said process includes the further step of the 
Assessment Micro Agent, ipVi^esponse to a change in a monitored data 
source, detecting alterations including new information. the process 
comprising the steps of: 

converting the schema obtained from the Schema Manager component of the 

Assessment Micro Agent into a language compatible to the Common 
Ontology; mapping schema element identifiers to a WordNet...to ontology 
mapping for each data source being 



jaapped by evaluating the ijia^ematical probabilities of lexical and 
semantic relationships ' 

between schema entities and ontology concepts; 

1 0 determining lexical closeness between the data source ontology and 
I 1 Common Ontology concepts using synonym relationships; 

1 2 determining mathematical closeness of semantic relationships in 
the form of 
1 3 hypernyms; and 
determining confidence factors based on the . . • process includes the 

further steps of: 

comparing the data source ontologies of the monitored data sources to 

deterinine common concepts; ^ ^ 
mapping a data source ontology to another data source ontology using 
synonym and hypernym relationships... 

...sample of data element values from each said data sources and 
comparing said data elernent-values to determine mathematical 
closeness; 

validating expected data values for said data source * ontology mappings; 
1 0 composing and decomposing semantic relationships between target 
and ^ 
1 1 source data source ontology elements; and 

1 2 uniting semantically similar schema. . .between changed data sources, 
said process for generating dynamic 
adapters including the steps of: 

after an integration plan between two data sources has been 
generated, an Assessment Micro Agent determining that one of said 
data source's data structure has changed and, in response to said 
detecting ... to ontology mapping for each data source being 
mapped by evaluating the mathematical probabilities of lexical and 
semantic relationships 

between schema entities and ontology concepts; 

determining lexical closeness between the data source ontology and 
Common Ontology concepts using synonym relationships; 

1 1 determining mathematical closeness of semantic relationships in 

the form of 
hypernyms ; 

determining confidence factors based on the mathematical probability of 
said data source ontology and said Common Ontology being... 

...process includes the fur^^r' steps of: 

comparing the data source ontologies of the monitored data sources to 

determine common concepts; 
mapping a data source ontology to another data source ontology using 
synonym and hypernym relationships comparing said data element values to 
determine mathematical closeness; 

validating expected data values for said data source ontology mappings; 
1 0 composing and decomposing semantic relationships between target 

and 

1 1 source data source ontology elements; and 

1 2 uniting semantically similar schema steps of: 

before. an integration plan ^between said two data sources has been 
generated, an App2App Similarity Mapper determining the similarities 
between said two data sources and informing a Planner software component 
to generate a new. .. processes . 154. A process of managing revision in a 
data source including the steps of: 

connecting an Assessment Micro Agent to a data source; 

using the Schema Manager, extracting information about the data source; 

using - . . 
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French Abstract 

La presente invention Corfcerh^* uh 'syst^i¥ie informatique de transaction 
entre entreprises qui dans un mode de realisation prefere est destine a 
fournir des services de location de vehicules pour des utilisateurs a 
demande elevee comportant un portail de reseau Internet grace auquel 
1 'utilisateur a demande elevee peut acceder a une pluralite de 
fournisseurs de services comportant un reseau informatique d*entreprise 
integre pour au moins un fournisseur de services de location de 
vehicules. Le reseau informatique de fournisseur de services de location 
de vehicules est configure ^Qur 1 * interconnexion d*une pluralite de 
succursales de diversite .§^ographique, presentant le catalogue de leurs 
vehicules de location disponibles et des programmes les concernant ainsi 
que pour la gestion de toutes les donnees de transaction concernant son 
entreprise. Le portail de reseau Internet permet une connectivite et une 
transf erabilite universelles pour une association d* entreprises a 
plusieurs niveaux qui placent regulierement des demandes elevees d' achat 
de location avec son associe commercial et egalement les autres 



.fournisseurs de services 'qui peuvent ou non avoir le meme systeme et 
logiciel inf ormatique d * entreprise integre. L * utilisation du precede et 
de I'appareil de la presente invention permet de placer, de grands 
volumes de transactions de location, de les controler, de les modifier en 
cours d* operation, et de les conclure avec des operations de comptabilite 
financiere et paiement pratiquement sans intervention humaine. 
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French Abstract 

L' invention concerne un systeme, un precede, et un article manufacture de 
gestion proactive mis en ®e&vre au cours de la maintenance et de 
I'entretien d*un environriement du type chaine d' approvisionnement 
reseautee. Les appels telephoniques, les donnees et autres informations 
multimedia sont routes via un reseau assurant le transfert des 
informations via Internet au moyen d* inf ormations de routage telephonique 
et d' informations d*adresse de protocole Internet. Ledit reseau comprend 
un gestionnaire de seuil proactif qui avertit a 1 ' avance les fournisseurs 
d*une rupture de contrat imminente. Ledit gestionnaire de seuil proactif 



envoie une alarme au fournisseur de services lorsque le niveau de service 
du moment n* attaint plus le niveau de service determine dans le contrat 
en termes de maintien d*un certain niveau de service. 
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Detailed Description ' 

a chart that can be utilized to determine whether to use Netcentric 
technology; Figure 88 is a chart that can be utilized to determine 
whether to use Client Server teclinology; Figure 89 is a chart. . .user may 
be permitted to customize the fonnat of the display catalog for his or 
her particular session / or the custornizations may be saved so that the 
user's personalized settings are used each time... 
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Detailed Description 

order, is generally handled in an organized 5 manner using layered 
communication architectures. Such architectures address the two 
portions of the communications problem, one being that the delivery of 
data by an end user to ...the other being that the delivered data must 
be recognizable and in proper form for use. These two portions are 
handled by protocols, or standard conventions for communication 
intelligently, the first by network protocols and, . . 

•...and network -layers., and -the. other four -providing services to the end 
user by means of transport, session t presentation, and application 
layers, from lowest to highest layer. 

X.25 is an interface organized as a... of, and/or implements a negotiation 
process between, two or more sets of control information submitted by 
two or more parties. WAF also accommodates a semiautomated process 
during which one or more WAF participants directly. . . 
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English Abstract 

A universal data and software structure and method for an 7Vny-to-Any 
computing machine in which any number of any components can be related to 
any number of any other components in a manner that is not intrinsically 
hierarchical and is intrinsi.cally unlimited. The structure and method 
includes a Concept Hierarcha^;^ each concept or assembly of concepts is 
uniquely identified and aS^lgned a number in a Numbers Concept Language 
or uniquely identified in a Non-numbers Concept Language. Each Component 
or assembly of Components is intrinsically related to all other data 
items that contain common or related components. 

French Abstract 

L' invention concerne une structure de donnees et de logiciel universelle 
ainsi qu ' un procede de machine informatique toute categorie dans laquelle 
des composants, quels qu*ils soient et quel que soit leur nombre, peuvent 
etre rattaches a d'autres composants^ quels qu'ils soient et quel que 
soit leur nombre, d'une maniere intrinsequement non hierarchisee et 
intrinSequeme'nt' illlmitee*. La^"stracture"et * le procede comportent" une* 
hierarchie conceptuelle ; chaque concept ou ensemble de concepts est 
identifie de maniere unique et recoit un numero dans un langage 
conceptuel de norabres ou dans un langage conceptuel de non-nombres. 
Chaque composant ou ensemble de composants est intrinsequement rattache a 
tous les autres elements de donnees qui contiennent des composants 
communs ou associes. 
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Claims 

Claim 

Any-to-Any machine has a further method that enables a computer to use 
the recorded assembly plan to assemble the 'letter* on demand. Because 
these methods' enable assembly* plans to 'be stored and used... 

. . .The methods of the Any-to-Any machine provide for: 

179 

I Creating and storing an assembly plan for each item. Therefore, if 
the 

name 'John Brown' is part of an item, one or more... 
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SYSTEME, PROCEDE ET ARTICLE MANUFACTURE POUR DETERMINER LES NIVEAUX DE 
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D< EVALUATION DE PROCESSUS DANS UNE ETUDE DE MATURITE OPERATIONNELLE 
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French Abstract ^ 

L* invention concerne un sjigbeme, un procede et un article manufacture qui 
permettent de determiner 'les niveaux de capacite d'une zone de processus 
de surveillance lors de 1* evaluation de la maturite d'une organisation 
d ' operations , En premier lieu, on definit une pluralite d*attributs de 
processus. Pour chacun de ces attributs, on determine une pluralite de 
pratiques generiques . Ces pratiques generiques comportent notamment des 
pratiques de base, par exemple: verification d*un etat courant, collecte 
et documentation d * informations de surveillance, classification 
d* evenements, attribution de degres de gravite, evaluation d*impact, 
analyse de fautes, acheminement de fautes a corriger, mise en 
correspondance de types d'evenements par rapport a un diagnostic 
predefini et/ou, des procedurea. correctives, enregistrement des. evenements 
localement et/ou a distance, suppression de messages jusqu'a ce que des 
seuils soient atteints, affichage des informations d*etat sur au moins 
une console en plusieurs formats et a plusieurs emplacements, emission 
d*ordres sur des processeurs a distance, installation et changement de 
filtres locaux et/ou a distance, installation et changement de programmes 
de seuils locaux et/ou a distance, analyse du courant de trafic et envoi 
de messages radiodif fuses . On calcule alors une maturite d*une 
organisation d* operations sur la base, du moins en partie, de la 
realisation des pratiques ^generiques . . 
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Detailed Description 

is a Generic Practice and is applicable to all processes. Service 
Level .lylanagernent and Migrati9n. Control . are . two different Process Ajeas 
with different Base Practices, goals, and purposes. However, they share 
the same Generic Practice ... everyone involved. 

Gather Data 

Data for the assessment are obtained from several sources: responses to 
the maturity questionnaires , interview sessions , work products, and 
document reviews. Documents are reviewed in order to verify compliance. 
Interviewing provides ^ 
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English Abstract 

The present invention provides a facility for identifying the unknown 
language of text represented by a series of data values in accordance 
with a character set that associates character glyphs with particular 
data values. The facility first generates a characterization that 
characterizes, the series of ^ata . values ..in , terms of the occurrence ojf 
particular data values on the series of data values. For each of a 
plurality of languages, the facility then retrieves a model that models 
the language in term of the statistical occurrence of particular data 
values in representative samples of text in that language. The facility 
then compares the retrieved models to the generated characterization of 
the series of data values, and identifies as the distinguished language 
the language whose model compares most favorably to the generated 
characterization of the series of data values. 

French Abstract 

La presente invention concerne une f onctionnalite permettant 
d* identifier la langue non connue d'un texte represente par une serie de 
valeurs de donnees conformement a un jeu de caracteres qui associe les 
glyphes de caracteres a des valeurs de donnees specif iques. La 
f onctionnalite genere d'abord une caracterisation qui caracterise la 
serie de valeurs de donnees selon 1' occurrence de valeurs de donnees 
specif iques dans la serie de valeurs de donnees. Pour chaque langue parmi 
plusieurs langues, la f onctionnalite extrait alors un modele qui modelise 
la langue selon 1* occurrence statistique de valeurs de donnees 
specifiques d,ans des echanti^J-^pns . representatif s de texte . dans . cette. 
langue. La f onctionnalite compare ensuite les modeles extraits a la 



caracterisation generee de la serie de valeurs de donnees et identifie 
coitime langue du texte la langue dont le modele obtient les meilleurs 
resultats de comparaison avec la caracterisation generee de la serie de 
valeurs de donnees. 

Fulltext' Availability: * ' " " 
Detailed Description 

Detailed Description 

15 facility reduces 256 source values to only 129 target values, 
thereby reducing storage requirements for unigram . frequency 
statistics by nearly 50%. 

In step 405, the facility sj^ras the one-dimensional characterization of 
the sample... 3. This mappiTig reduces 256 source values to 155 target 
values, thereby reducing the storage requirements for bigram frequency 

statistics by over 63%. 
In step 407, the facility sums the two-dimensional characterization of 
the sample . - . 

...for this combination of language and character set. For example, because 
step 406 specifies the use of bigram frequency statistics, the 
facility constructs a two-dimensional matrix when performing step 502. 

In each of its dimensions... 

. . .of 

the specified length among the series of 
501. That is, where bigrazn frequency 
facility begins by considering the first 
serxes. . . 
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English /abstract 

A universal epistemological machine (U.M.) enables arbitrary synthetic 
forms of existence (that is, thinking machines) known as androids, which 
know arid perceiVe the world do 'humah 'beings . The U.M. embodies 
transformations of an extended existential universe of human being, and 



target values created in step 
statistics are specified, the 
and second target values in the 



comprises means for transforming, representing, enbodying, translating 
and realizing a plurality of universal forms. These universal forms 
comprise universal objects ^n^ the form of physical embodiments of 
universal knowledge strucSiires, The U.M. comprises a plurality of 
epistemic instances comprising the universal objects and universal 
transformations of those universal objects, expressed in a universal 
grammar, which allows all human knowledge to be enabling media for the 
U.M. 

French Abstract 

Une machine epistemologique universelle (M.U.) permet de creer des 
formes de vie synthetiques arbitraires ( c ' est-a-dire des machines 
pensantes) connues sous le nom d^androides qui connaissent et percoivent 
le monde comme le font les etre humains. La M.U. integre des 
transformations' d' un uiiivers * existentiel etendu d'etres humains et 
comprend des moyens permettant de transformer, representer, integrer, 
traduire et realiser une pluralite de formes universelles . Ces formes 
universelles comprennent des objets universels se presentant sous forme 
de representations physiques de structures de connaissances universelles. 
La M.U. comprend une pluralite d' instances epistemiques comprenant ces 
objets universels et les transformation universelles de ces objets 
universels, exprimees dans une grammaire universelle qui permet a toute 
la connaissance humaine d^^ce un support d* integration pour la M.U. 

Full text Availability: 
Claims 

Claim 

real universe, and indeed of the physical universe, must then be the 
same form that symbolizes the semantic origin of all forms of language, 
or meaning itself, including the meanings of forms known in the... 
universe can be construed from the two perspectives of intrinsic and 
extrinsic forms, the unified theory develops two suitable 
representations of the causal element to reflect an emphasis on either 
viewpoint. When' we represent' the enabler 'si . .in language, as opposed' to 
some other real form, such as a hand motion of a non- syxnbolic nature. 
Though this modality of existence is taken up in greater depth in chapter 
five, in any. . , 
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English Abstract 

The present invention is directed to a word game wherein words are 
formed by arrangement of letters of the alphabet appearing on the faces 
of a set of dice wherein each letter represented and the number of times 
the letter is represented in the set of dice, approximates the frequency 
of their appearance in a corpus of words. 

French Abstract ^ 4 

L* invention concerne un .Jfeu de vocabulaire dans lequel les mots sont 
formes a partir des lettres de 1' alphabet apparaissant sur les faces d'un 
ensemble de des, chaque lettre representee et le nombre de fois ou la 
lettre est representee dans 1* ensemble de des correspondant a leur 
frequence d* apparition dans un corpus de mots. 

Full text Availability; 
Detailed Description 

Detailed Description 
. . . Verbal 

Learning, Chicago: Lippincott/ Mayzner; "M. S . and M.E. Tressect". 1965*. 
Tables of single letters and bigram frequency counts for various 
word-length and letter-position combinations. Psychonomic Monograph 
Supplements , Vol . 
I , No , 2 , pp . . . 
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ABSTRACT EP 247395 A2 

To translate Japanese having no morphological distinction between 
singular and plural forms into English having a morphological distinction 
between the two, for instance, the dictionary unit (6) includes semantic 
information indicative of the plural number, and the translation unit (5) 
syntactically and semantically translates Japanese into English as 
follows: a Japanese sentence* a?s morphologically analyzed into basic ♦ 
morphemes by an inflection information dictionary (6a) ; lexical syntactic 
information of the Japanese basic morphemes are retrieved from a 
word/phrase dictionary (6b); the Japanese sentence is syntactically 
analyzed into an intermediate structure by Japanese syntactic analysis 
grammar to clarify modification relationship between two words; the 
Japanese intermediate structure is semantically transferred into an 
English intermediate structure by structure transfer grammar; a concept 
data of a noun (e.g. "book''^ ^which includes FEATURE : NUMBER = PLURAL is 
formed when the noun is modified by an adjective (e.g. "many") indicative 
of the plural number; an 'English sentence structure is syntactically 
generated from the intermediate structure and by English syntactic 
generation grammar; and lastly an English morphemes are generated by 
morphological generation grammar to change "book" into "books". 
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...SPECIFICATION of a cursor to a character unit or a word unit, etc. 

Fig. 3 shows an exemplary layout on a display image of the display 
unit 8, in which the image is divided into three, edition area, 
original (1st language) display area, and translation (2nd language) 
display area... -^i^V 

...In other words, the dictionary includes information indicative of 
"plural". Therefore, when the sentence is analyzed syntactically so 
that the relationship between a noun and a modifier is clarified, it is 
possible to determine the noun as plural... 
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Main International Patent Class: G06F-017/30 
Publication Language: English 
Filing Language: English 
Fulltext Availability: 

Detailed Description 

Claims 

Fulltext Word Count: 4112 
English Abstract 

A system and a method for processing electronic documents are described 
in which an input document Dl and reference data D2 are examined whether 
there is a content-based relation between the input document Dl and the 
reference data D2 . For the case of a content-based relation, a type of 
linkage is selected from a number of predefined types of linkages in 
accordance with the type of content-based relation and a respective 
linkage between' the documents'"is established. The invention makes it 
possible for the type of relation between two documents to be recognized 
automatically. For example, a document flow can be segmented in suitable 
manner and classified and also stored in an appropriately interlinked 
manner. 

French Abstract 

L' invention concerne un systeme et un procede de traitement de documents 
elect roniques, dans lesquel^ Ain document d' entree Dl et des donnees de 
reference D2 sont examinees afin de determiner s*il existe une relation 
de contenu entre le document d' entree Dl et les donnees de reference D2. 
Dans le cas ou une telle relation existe, un type de lien est selectionne 
a partir d'un certain nombre de types predefinis de liens selon le type 
de relation de contenu, et un lien respectif est etabli entre les 
documents. L* invention permet de reconnaitre automatiquement le type de 
relation existant entre deux documents. Par exemple, un flux de documents 
peut etre segmente de maniere appropriee, classe et aussi stocke de 
maniere appropriee selon le lien etabli entre les documents. 
Legal Status (Type, Date, Text) 

Publication 20040429 A2 Without international search report and to be 

republisKed upon receipt of that report. 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Claims 

Claim 

... be a second document. The reference data may also be a group (cluster) 
of documents or a represen^art:ion thereof. On the basis of the analysis 
a decision is made whether^ there is a content-based relation. 
Subsequently, the sort of this relation is determined and attempts are 
made to assign this to a type. For this purpose a number of possible... 

...review of the book document B" . A contentbased relation is a decisive 
factor here, which relation is determined by the type of linkage. Such 
linkage preferably has a fixed direction. An example for a cluster... 

...of documents. For detennining the type of content-based relation one may 
then fall back on the determined type of document. 

A further embodiment of the invention provides that the input document 
comprises a text... 

. . .be 

elucidated with reference to the embodiments described hereinafter. 
In the drawings: 

Fig. 1: shows in a syxnbolic representation linkages between three 
documents; Fig. 2: in a symbolic representation elements of an 



information processing system. Fig. 1 shows in a symbolic representation 

the three documents Dl, D2 and D3. 
In the present example the document D2 is a video... 

...type "comment on". The linkage is set and points from document DI to 
document D2 . It thus indicates as content-related link between DI and 
D2 that the content of D 1 is a comment... 

...be a second document. Siftiillrarly, the reference data may also be a 

cluster of documents or a representation thereof, respectively. If no 
content-based match between the input document and the reference data is 
found. , . 

...linkages in a table. Moreover, the information contained in the data 
portion of the input document is assessed • The results of the search 
for key phrases and the additional information from the data portion of 
the input document are assessed to select a type of linkage. A linkage 
of the selected type of linkages is generated between... 

...in the two documents is -established, where n is suitably selected. A 
vector distance may then be determined which may be regarded as a 
parameter for content-based relation between the documents . Such 
techniques are. . . 

...individual comparisons. For perfon-nance reasons, however, the document 
may also be compared with one or more representations of the cluster. 
Such representations condense common matters of the documents of the 
cluster. If, for example, the^ word frequency method defined above is 
worked with, a representation of a cluster comprises a list of terms 
recurring in the documents of the cluster. 
In the. . . 

...in the 1 5 database (generic terms: general; special terms: special) and 
thus a suitable or numerical measure can be formed for the degree of 
specialization of the terms used. If, for example, it is... 

...constant flow. The data flow D 1 ... Dn may finally also be the result 
of a constant assessment or the result of the transmissions of various 
news stations. The documents D I ... Dn are first... 

...in the data memory 12. If there is a content-based relation, its type is 

detennined, as indicated above and a respective linkage L is 
established. The currently processed document and all the linkages L. . , 
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English Abstract ' 

A system (1480) filters -documents in a document set retrieved from a 
document store in response to a query. The system (1480) obtains (1830) a 
first set of logical forms based on a selected one of the query and the 
documents in the document set. The system (1480) obtains a second set of 
logical forms based on another of the query and the documents in the 
document set. The system (1480) then uses natural language processing 
techniques to modify (1832, 1834) the first logical forms to obtain a 
modified set of logical forms. The system (1480) filters (1836) documents 
in the document set based on a predetermined relationship betwwen the 
modified set of logical forms and the second set of logical forms. 

French Abstract 

L'invention concerne un systeme (1480) qui filtre des documents dans un 
ensemble de documents recuperes a partir d*une mise en memoire de 
documents suite a une requete. Le systeme (1480) obtient (1830) un 
premier ensemble de formes logiques tenant compte d*une des requetes 
choisies et des documents se trouvant dans 1* ensemble de documents. Le 
systeme (1480) obtient un second ensemble de formes logiques tenant 
compte d*une autre des requetes et des documents se trouvant dans 
1* ensemble de documents. ^gWysteme (1480) utilise ensuite les techniques 
de traitement du langage 'naturel pour modifier (1832, 1834) les premieres 
formes logiques pour obtenir un ensemble modifie de formes logiques. Le 
systeme (1480) filtre (1836) les documents contenus dans 1* ensemble de 
documents en fonction d'un rapport predetermine entre 1* ensemble modifie 
de formes logiques et le second ensemble de formes logiques. 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Detailed Description 

.Detailed. Description ... 

This 

lexicon includes various classes of words, such as, 
e.g., prepositions, conjunctions, verbs, nouns, 
operators and quantifiers that define syntactic and 
semantic properties inherent in the words in an input 
string so that a... 

...such as logical form 

triples or logical form g;;g^rf within a logical form, 
capable of portraying a 'semantic relationship ) can be 
preccmputed, while a corresponding document is being 
indexed, and stored, within, e.g., a record... 
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English Abstract 

Apparatus and accompanying methods for an ' information retrieval system 
that utilizes natural language processing to process results retrieved 
by, for example, an information retrieval engine such as a conventional 
statistical-based search engine, in order to improve overall precision. 
Specifically, such a search ultimately yields a set of retrieved 
documents. Each such document is then subjected to natural language 
processing to produce a set of logical forms. Each such logical form 
encodes, in a word- relation-word manner, semantic relationships, 
particularly argument and a^j^inct structure, between words in a phrase. A 
user-supplied query is analyzed in the same manner to yield a set of 
corresponding logical forms therefor. Documents are ranked as a 
predefined function of the logical forms from the documents and the 
query. Specifically, the set of logical forms for the query is then 
compared against a set of logical forms for each of the retrieved 
documents in order to ascertain a match between any such logical forms in 
both sets- Each document that has at least one matching logical forms is 
heuristically scored, with each different relation for a matching logical 
forms being assigned a different corresponding predefined weight. The 
score of each such document is, e.g., a predefined function of the 
weights of its uniquely matching logical forms. Finally, the retained 
documents are ranked in order ^of descending score and then presented to a 
user in that order. 

French Abstract 

Appareils et precedes associes, pour un systeme de recherche 
d' information utilisant le traitement en langage naturel pour traiter les 
resultats extraits, par exemple, par un moteur d'extraction d' information 
comme un moteur de recherche a base statistique classique, afin 
d'ameliorer la precision gl^ale. Ladite recherche permet notamment de 
produire en final un ensenfele de documents extraits. Chaque document est 
ensuite soumis a un traitement en langue naturelle de sorte qu*un 
ensemble de formes logiques soit produit. Chaque forme logique code, en 
mode mot-relation-mot, les relations semantiques, notamment la structure 
d* argument et d' adjonction, entre les mots d*une phrase. Une demande 
formulee par 1 'utilisateur est analysee de la meme maniere de sorte qu'un 
ensemble de formes logiques correspondantes soit produit. Les documents 
sont classes en fonction, de maniere predeterminee, des formes logiques 
provenant des documents et de la demande. Specif iquement, 1* ensemble de 
formes logiques pour la demande est ensuite compare a un ensemble de 
formes logiques pour chacun des documents extraits, de maniere qu'un 
appariement soit etabli entre chaque forme logique des deux ensembles. 
Chaque document qui presente au moins une forme logique appariee est 
evalue de maniere heuristique, un poids predefini different et 
correspondant different etant attribue a chaque relation differente pour 
une forme logique appariee. L* evaluation de chaque document est fonction, 
par exemple, de maniere predeterminee, des poids de ses formes logiques 
appariees uniques. Les documents retenus sont ensuite classes dans 
l*ordre decroissant puis presentes a un utilisateur dans cet ordre. 
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Detailed Description 

This lexicon includes va);i^us classes of words, such 

as, e.g., prepositions, conjunctions, verbs, nouns, 

operators and quantifiers that define syntactic and 

semantic properties inherent in the words in an. input 

string so that a. . . 
...such as logical form 

triples or logical form graph within a logical form, 

capable of portraying a semantic relationship ) can be 

precomputed, while a corresponding document is being 

indexed, and stored, within, e.g., a record... 
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Claims 

Fulltext Word Count: 12812 W 
English Abstract 

A computer system for creating a semantic object model from an existing 
relational database schema. The computer system analyzes the catalog 
information of the relational database schema and creates a semantic 
object for each table defined in the catalog. For each column defined 
with a table, a simple value attribute is added to the semantic object 
created for the table. The system then analyzes the relationship 
information stored in the catalog to create object link attributes that 
define relationships between two or more semantic objects as well as to 
create multivalued group attributes and multivalued, simple value 
attributes . If the database catalog does not include the relational 
information, the user is prompted to indicate related semantic objects. 

French Abstract 

Cette invention concerne un systeme informatique permettant la creation 
d'un modele d'objets semantiques a partir d'un schema de base de donnees 
relationnelles existante. Ce systeme informatique analyse les 
informations du catalogue du schema de la base de donnees relationnelles 
et cree un objet semantique^p^our chaque table definie dans le catalogue. 
Pour chaque colonne definie a I'interieur d'une table, on ajoute un 



attribut de valeur simple a I'objet semantique cree pour cette table. Le 
systeme analyse alors les informations relatives aux relations et 
stockees dans le catalogue pour creer des attributs de liaison d^objets 
qui definissent les relations entre au moins deux objets semantiques 
ainsi que pour creer des attributs de groupes a valeurs multiples et des 
attributs de valeurs simpieS'/ a valeurs multiples. Si le catalogue de la 
base de donnees ne contient pas les informations relationnelles , 
l*utilisateur recoit un message lui demandant d'indiquer les objets 
semantiques associes . 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Claims 

Claim 

causes the 

central processing unit to pec^form the - following functions: 

a) to analyze the database catalog to determine each relational 
database table defined in the existing relational database schema 

b) to create a semantic object... 

...relational database table corresponding to the semantic object created; 

c) to analyze each column defined in the relational database 
and 

semantic object created that corresponds to a column defined for the 
relational d) to 
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English Abstract 

An improved method for linking images at program activation is provided 
by use of a symbol vector in a sharable code image • The symbol vector 
is automatically constructed which the linker and operating system use to 

• effect "fast looJcup of symbol* values at-program activation^ thus providing 
flexibility similar to that of link-time binding. For each sharable 
image being constructed, the programmer provides a list of symbols which 
are to be made visible outside of the image . These symbols may be 
procedure names, data cells, absolute values, or any other valid use of a 
symbolic value. The order of this list must remain fixed from one image 
build to the next. From this list, the "symbol vector" is constructed (as 
by the linker) of the value of each of the identified symbols, and the 
symbol vector is associated, with the sharable image . A symbol table is 
also associated with the s^aarable image , where each symbol has the 
value of its index in the symbol vector. When resolving references to 



6ther images , the linker does a symbolic lookup in the symbol table 
of the target image and obtains the index into t the target symbol 
vector. That index is bound into the calling image . Then, at program 
activation, the image activator uses the index bound into a calling 
image to obtain the current value of the symbol in the target image . 

.French Abstract. , ... 

Precede ameliore de liaison d* images se faisant- au moment du 
declenchement d'un programme utilisant un vecteur de symbole d'une image 
de code partageable. Le vecteur de symbole se construit automatiquement, 
ce que I'editeur de liaison et le systeme d ' exploitation utilisent pour 
effectuer une recherche rapide des valeurs de symbole au declenchement du 
programme, realisant de ce fait, une souplesse semblable a celle d*une 
edition de liaison temporelle. Le programmateur fournit, pour chaque 
image partageable construite", une liste de symboles devant etre rendus 
visibles en dehors de 1 * iiQaV^. Ces symboles peuvent etre des noms de 
procedures, des cellules 'de donnees, des valeurs absolues ou toute autre 
utilisation valable d'une valeur symbolique. L'ordre de cette liste doit 
rester fixe d*une image a 1* autre. On construit, a partir de cette liste, 
le "vecteur de symboles" (comme par I'editeur de liaison) de la valeur de 
chacun des symboles identifies et le vecteur de symbole est associe a 
1 ' image partageable. Une table de symboles est egalement associee a 
1 ' image partageable, dans laquelle chaque symbole a la valeur de son 
indice dans le vecteur de symbole. Quand il resout des references a 
d*autres images, I'editeur de liaison effectue une recherche symbolique 
dans la table de symboles de 1* image cible et introduit 1' indice dans le 
vecteur de syjnbple de la .cibj.e^. C^t indice. est edite dans.l'im^ge 
d'appel. Ensuite, au moment du declenchement du programme, le declencheur 
d' image utilise 1' indice edite dans 1 * image d'appel pour obtenir la 
valeur courante du symbole dans 1' image cible. 

Main International Patent Class: 606F-009/445 

Fulltext Availability: 
Detailed Description 

English Abstract 

An improved method for linking images at program activation is provided 
by use of a symbol vector in a sharable code image . The symbol vector 
is automatically constructed which the linker and operating system use to 
effect fast lookup. . . 

...values at program activation, thus providing flexibility similar to that 
of link-time binding. For each sharable image being constructed, the 
programmer provides a list of symbols which are to be made visible 
outside of the image • These symbols may be procedure names, data cells, 
absolute values, or any other valid use of a symbolic value. The order of 
this list mus.t remain fij^ed f;;om.Qne image build to the next. From, this 
list, the "symbol vector" is constructed (as by the linker) of the value 
of each of the identified symbols, and the symbol vector is associated 
with the sharable image . A symbol table is also associated with the 
sharable image , where each symbol has the value of its index in the 
symbol vector. When resolving references to other images / the linker 

does a symbolic lookup in the symbol table of the target image and 
obtains the index into t the target symbol vector. That index is bound 
into the calling image . Then, at program activation, the image 
activator uses the index l^Wd into a calling image to obtain the 
current value of the symbol in the target image . 

Detailed Description 

... has the value of its index in the symbol vector. When resolving 

references to other images, the linker does a symbolic lookup in the 

symbol table of the target image and obtains the index into the target 

symbol vector. That index is bound into the calling image . Then, at 

program activation, the image activator uses the index bound into a 
calling image to obtain the current value of the symbol in the target 
image • 



Because of the design of the advanced RISC 'instruction set, and in some 
cases because of the ... ongoing entries 24 must be in the same original 
position. When resolving references to other images^ the linker does a 
symbolic lookup in the symbol table 21 of the target image / finding an 
entry 22., and obtains the index into the symbol vector 20 for this 
target image I 1; that index (ordinal number) for that image name 1 1 
is bound into the calling image *10" at 'the calling position 12 . At 
program acdvation, the image activator routine uses the index bound 
into the calling image 10 to obtain the current value of the symbol in 
the target image 11, by referencing the header 23 to get the pointer to 
the symbol vector 20 and indexing into the symbol vector to the entry 24 
which contains the actual offset into the image 1 1 where the procedure 
13 is to be found. The symbol vector 20 does not become... 
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ABSTRACT 

PROBLEM. TO BE S.OLVED:. To , solve, the problems that, if a statistical language 
model is formed from a language corpus 201 having an extremely high sparse 
characteristic, there is a tendency to inclusion of words having a high 
unigram frequency as the recognition result of speech using the same 
and a recognition error is liable to occur. 

SOLUTION: This system is provided with word connection probability 
calculating means of inputting a backoff probability to a monotonously 
increasing function and determining the result of the computation by this 
function as a fresh conditional probability of a word chain in making 
calculation by subjecting^ ^ th'e conditional probability of the word chain 
having a low appearance frequency to backoff smoothing. 
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GENERATING METHOD, TEXT READING ARRANGING DEVICE, TEXT READING ARRANGING 
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ABSTRACT 

PROBLEM TO BE SOLVED: To eliminate a deficiency of reading-arranged 
character bigram frequency data by calculating the distance between a 
KT^NJI (Chinese character) string and a reading character string by using a 
single-KANJI reading table and finding the correspondence between the KANJI 
string and the readings by the characters so that the sum of calculated 
distances become minimum. 

SOLUTION: An inter-element distance calculating means 1 calculates an 
inter- . constitution-element <;listance as the distance between the KANJI 
string as constitution elements of notation and the character string as 
constitution elements of readings by using the single-KANJI reading table 
wherein KANJI and a candidate for the reading of the KANJI are recorded 
corresponding to each other. Then a minimum distance correspondence search 
means 2 finds the correspondence between the KANJI string constituting the 
notation and the readings by the characters constituting the notation by 



u'sing a dynamic prograituriing' method 66 that the sum inter-cohstitution- 

element distances becomes minimum. Consequently, a deficiency of 

reading-arranged bigram frequency data can be eliminated. 
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Session search system for search engines, has client module updating 
search result set immediately when new data is availeUdle as long as 
search query is active, and server module terminating query when that is 
not active 

Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 

Inventor: EMENS M L; KRAFT R; YIM P C 

Number of Countries: 001* Number of Patents :' 001 

Patent Family: 
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Patent Details: 
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Abstract (Basic) : US 6633867 Bl 

NOVELTY - The system has a client module presenting a dynamic 
search result set to a user and a server module managing multiple 
active session queries . The client module updates the search result 
set automatically and immediately when new data is available as long as 
a search query is active, and the server module terminates the session 
query when that is no longer active. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(a) a computer program product for use with a search engine to 
query a dynamic data repository of the search engine when prompted* by a 
session query from a user 

(b) a method of querying a dynamic data repository of the search 
engine when prompted by a session query from a user. 

USE - Used for querying dynamic and large data repositories, and 
indices of Internet search engine providers. 

ADVANTAGE - The system enables the users to easily and conveniently 
perform a search query similar to the ad-hoc, without the need to 
subscribe or setup and manage a persistent query, and the session query 
is performed automaticairiy, without special user intervention. 

DESCRIPTION OF DRAWING (S) - The drawing shows a flow chart that 
depicts operation of a session search system, 
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Digital image retrieval method involves estimating semantic 

correlation between image pairs with respective bigram frequency , 

obtained through multiple search sessions 
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Abstract (Basic) : US 20030187844 Al 

NOVELTY - Semantically relevant images are determined from 
several images received through ' multiple search sessions / using 
relevance feedback. Semantic correlation between pair of images 

with respective bigram frequency , is estimated . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(1) computer readable medium for storing image retrieval program; 

and 

(2) computing device for image retrieval. 

USE - For retrieving di^gital image by content-based or 
keyword-based technique {is ing statistical bigram correlation model. 

ADVANTAGE - Semantic relationships between the images are 
captures easily. The iterative results of content based image retrieval 
(CBIR) system, is improved. Also performance of the system is improved. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
host computer using bigram correlation model, for image retrieval, 
processor (202) 
system memory (204) 
program molecules (206) 
program data (208) 
search engine (210)^ 
image data (222) 
display (230) 
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Face direction generation method for face recognition and gesture 
detection, involves determining quantitative face direction of face in 
image dependent upon rotation and tilt of face 
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Abstract (Basic) : SG 91841 Al 

NOVELTY - The rotation of face in image is computed dependent upon 
a nose axis of face, and the tilt of face in image is computed. The 
quantitative face direction of face in image is determined dependent 
upon rotation and tilt of face. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

. (1) A quantitative ,face direction .generation apparatus; 

(2) A computer readable medium storing program for generating 
quantitative face direction; 

(3) A facial gesture establishment method; 

(4) A facial gesture establishment apparatus; 

(5) A computer readable medium storing program for establishing 
facial gesture; 

(6) A visual mouse presentation method for detecting quantitative 
face direction; 

(7) A visual mouse ^Jas^entation apparatus for detecting 
quantitative face directions; 

(8) A computer readable medium storing for detecting quantitative 
face direction; 

(9) A video conference method; 

(10) A video conference apparatus; and 

(11) A computer readable medium storing program for video 
conferencing. 

USE - For recognizing faces from single image and for determining 

facial gestures. 

ADVANTAGE - Enables fast and easy face recognition by using a 
single image. Enables active system to be built with the faces only in 
given position, prior to face recognition, so that the size of the 
database is reduced and the number of error and the processing time 
during recognition is also reduced. Provides qualitative and 
quantitative measure of the face direction and the result is 
displayed on the face, by drawing the estimated nose axis, hence 
easily interface with supervised learning system and allows a human to 
input feedback. 

DESCRIPTION OF DRAWING (S) - The figure shows a symbolic diagram 
of correlation window^^r'responding to segment (x,theta) for two 
images , being a frontal case and a right rotated case at a face. 
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Abstract (Basic) : WO 200140997 Al 

NOVELTY - All interactions in a chat server are recorded in a 
history database at a computer center and incoming queries are 
processed by a multiple - session query parser (115) for possible 
matches stored in a knowledge base (117) . Matching queries are routed 
to the appropriate agent by a router application (123) and the matching 
queries are also displayed in a window (125) with the most accurate 
response being selected an4 sent to an editing window (127), to allow 
for editing. ^ 

DETAILED DESCRIPTION - AN INDEPENDENT CLAIM is included for a 
system for aiding agent hosts in a communication system. 

USE - Auto-assisting agents participating in agent-led chat 
sessions . 

ADVANTAGE - Quick response to many frequently asked questions. 
DESCRIPTION OF DRAWING (S) - The drawing shows a communication 

network 

Query parser (115) 
Knowledge base (117) 
Router application (123) 

Display window (125-) * 

Editing window (127) 
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